Proceedings of the DPF-2011 Conference, Providence, RI, August 8-13, 2011 



Study of Tau Leptons from Top Quark Pairs in ATLAS 



A. McCarn, On Behalf of ATLAS 
Department of Physics, University of Illinois, Urbana-Champaign, IL, USA 



Top quark pair decay events resulting in final states containing r leptons present interesting 
channels in the search for physics beyond the Standard Model. This document describes an analysis 
of r+jets in it decays, for the case of a hadronically decaying r lepton. The studied events are from 
1.03 fb~^ of proton-proton collision data at = 7 TeV recorded by the ATLAS Collaboration at the 
LHC. The goal of the analysis is to look for deviation from Standard Model expectations. Exclusion 
limits are produced for the product of the branching ratios BR(t — )■ H^b) and BR{H^ — >■ ti/) 
with respect to the charged Higgs boson mass, and also on the tan(/3) — Mff+ plane in the mh-max 
scenario of the MSSM. 



The decay of top quark pairs is a significant source of r leptons at the LHC. Furthermore, top quark decays to 
T final states present an interesting channel in the search for new physics. In particular, it is possible to calculate 
competitive limits on the exclusion of the light charged Higgs boson, a particle predicted by many non-minimal 
Higgs scenarios[l, 2], such as Two-Higgs-Doublet-Models (2HDM)[3]. One specific scenario considered is the 
mh-max scenario of the Minimal Supersymmetric Standard Model (MSSM), whose Higgs sector is described by 
a type-IL2HDM[4]. 

In the Standard Model, the top quarks in tt events are expected to decay exclusively into a bottom quark 
and W boson. W bosons can then decay hadronically into light quarks, or leptonically into an e, /i, or r lepton 
and associated neutrino. For charged Higgs boson masses, Af/f+, smaller than the top quark mass, m^, the 
dominant production mode at the LHC for is the decay t — > H^b of one of the top quarks in tt events. 
The charged Higgs boson decays via — > ti' almost exclusively in many scenarios [5]. Therefore, the charged 
Higgs boson would be produced in the place of a boson for a fraction of top quark decays in ti events and 
would then decay exclusively to t leptons, as shown in the diagram in Figure 1. Thus, if the charged Higgs 
boson exists, its preferential decay to r leptons would cause an excess of events with t leptons in the final state. 

The r lepton can decay either hadronically {BR ss 65%) or leptonically, into an e or {BR « 35%). In 
the course of this study, only hadronically decaying r leptons are considered, and any references to r leptons 
refer only to only those decaying hadronically. The channel under study is described by one W (or charged 
Higgs) boson decaying into a hadronic r+iy, and the other W boson decaying hadronically. This channel is 
referred to as "r-l-jets". Exclusion limits are calculated for the product of the branching ratios BR{t — > H^b) 
and BR{H^ tv) with respect to the charged Higgs boson mass, and also on the tan(/3) — Mfj+ plane in the 
mh-max scenario of the MSSM. 



FIG. 1: Dominant production and decay of the light charged Higgs boson {Mg+, smaller than the top quark mass, ntt) 
at the LHC . 
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II. EVENT SELECTIONS 

The T+jets final state is characterized by the presence of one hadronicahy decaying t lepton, at least 4 jets, 
and missing transverse energy. To select r+jets events, the following cuts are applied: 

1. Event preselection: 

(a) Data quality cuts. 

(b) iJJ,"'*" and tau trigger. 

(c) At least 4 jets with pr > 20 GeV. 

2. Exactly one trigger matched r jet with jjj, > 35 GeV. 

3. No identified e or ji. 

4. Erp^"'' > 40 GeV. 

5. Reject events with large reconstructed i?^*** due to the limited resolution of the energy measurement 
with a cut on the significance of E^^^^: — > 8 GeV^/^. 

6. At least one 6-tagged jet. 

7. Topologies consistent with a top decay arc identified by requiring that the qqb candidate with the highest 
p^^ value must satisfy m{qqb) € [120, 240] GeV. 

The final discriminating variable, which is used in limit setting, of the r+jcts analysis is Mt{t, E^^'^''), the 
transverse mass of the r lepton and i?™*****. In the case of most of the Standard Model background {tt and 
W+jets) this quantity is related to the W boson mass, and in the case of the charged Higgs boson, this quantity 
is related to the charged Higgs mass. 

III. DATA DRIVEN FAKE ESTIMATES 

An important aspect of the analysis is finding a way to estimate all of the backgrounds using data-driven 
methods. A method is developed for each significant source of background in the two analyses: jets being 
misidentified as r leptons, electrons being misidentified as r leptons, QCD multi-jet events, and events with 
true T leptons in the final state. This document focuses on the methods of estimating the contribution to 
background from jets being misidentified as t leptons and QCD multi-jet events, though the other two methods 
are briefly outlined. 

Hadronic t leptons present a challenge in identification. They decay into cither one or three charged tracks, a 
neutrino, and associated neutral particles. The distinguishing features of hadronic r decays in the detector are 
a low track multiplicity (1 or 3 charged tracks), and a narrow deposition of energy in the calorimeters. These 
hadronic r leptons look very similar to jets in the detector, and electrons are also sometimes misidentified as r 
leptons with 1 associated charged track. Understanding and estimating the misidentification of r leptons is one 
of the major challenges of the analysis. 

A. Electron-to-T Misidentification 

Electrons can be misidentified as r leptons with 1 associated charged track, but a dedicated electron veto 
algorithm greatly reduces the misidentification probability. To estimate the remaining background to each 
analysis from e-to-r misidentification, a tag-and-probe method is used on Z e+e~ events from collision 
data. The method used is identical to [6], but utilizes a larger dataset. Instead of using the misidentification 
probability directly, a scale factor is derived from the difference between the misidentification probability in 
collision data and in Z — )• e+e~ events in Monte Garlo. This factor is then used to scale simulated events where 
the reconstructed r originates from a true electron. The contribution from events with electrons misidentified 
as r leptons is less than 5% of the total background. 
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B. Jet-to-T Misidentification 

Events with jets misidentified as r leptons arc the dominant source of backgrounds with misidentified r 
leptons. As a result, the hkehhood-based identification algorithm has been developed to distinguish r leptons 
from light quark jets, the object most commonly misidentified as a t lepton. Even with the rejection power 
of the identification algorithm, light quark jets are often misidentified as r leptons. Less commonly, gluons or 
6-jets in ti and M^+jets events can also be misidentified as r leptons. As the misidentification probability is 
higher for light quark jets than for gluons[7], a data driven method based on the measurement of the light quark 
misidentification probability is taken as a conservative estimate of the contribution to the total background. 

The backgrounds from jets misidentified as r leptons are estimated using misidentification probabilities taken 
from 7+jets events in collision data. A 7+jcts event is defined by passing a single 7 trigger, and by having at 
least one well-identified 7 and jet. The leading 7 and jet are required to be back-to-back and balanced in pt, 
and any additional jet is required to have less than 20% of the px of the lead 7. 

I ' ' ' ' I ' ' ' ' H 



dt = 1.03fb"^ : 
4S Preliminary 



1.5 2 2.5 

H 

FIG. 2: Jet-to-T Misidentification Probability. Misidentification probabiUty evaluated for the tightest working point of 

the r identification algorithm, corresponding to a r efficiency of 30%. The probabilities pictured are for 1-prong and 
3-prong T leptons vs. r pr (Left), and for 1-prong and 3-prong rs vs. t rj (Right). 

The 7 is the tag object, and the jet opposite is the probe object, through which the misidentification prob- 
ability is calculated. The denominator of this probability counts events which pass a loose r selection. This 
selection requires the r candidate to have pT > 20 GeV, to be seeded by a calorimeter-based algorithm, not 
be vetoed by the dedicated algorithm for rejecting electrons, and have exactly one or three associated charged 
tracks. The numerator counts events which pass all of these requirements, in addition to a likelihood-based r 
identification algorithm, which is optimized for accepting r leptons and rejecting jets. 

TABLE L Background estimates from jet misidentification probability compared to the corresponding Monte Carlo 
prediction 



ATLAS Preliminary 


Sample 


Misid. Prob. 


MC Prediction 


tt 


2.8 ± 1.0 


3.8 ±0.6 



The misidentification probability is dependent on both p^ and ry^ and it is binned accordingly. For pT, the 
misidentification probability is split into six bins, with tighter binning at lower values and coarser with rising 
PT- For 77, the probability is separated into the regions corresponding to the barrel region of the detector, 
the transition region between the barrel and the end cap, and the end cap. The calculated misidentification 
probabilities are shown in Figure I, for one and three prong r leptons with respect to px and ry. In each case, 
the fake rate for a working point with r efficiency of 30% for the identification algorithm is displayed. This 
tight working point in the r identification algorithm is the one employed for the T-|-jets analysis. 
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This misidentification probability is then apphed to simulated events to predict the number of events arising 
from jets misidentified as r leptons. Table I shows the estimates based on the misidentification probability and 
the expectation from simulated events for the r+jets final states. Events with jets misidentified as r leptons 
make a very small contribution to the total background. 



C. QCD Multi-jet Estimate 



The background from QCD multi-jet events is not well modeled in Monte Carlo, so it is estimated using a 
fitting method. An inverted selection is defined that is orthogonal to the baseline selection. The underlying 
assumption of the method is that the shape of the i?™'"* distribution is the same for both the baseline and the 
inverted selection. This is shown to be valid in the comparison of the i?™'*^ distributions for the baseline and 
inverted selections in Figure 3. Differences in the two distributions are taken into account in the systematic 
uncertainties of the method. 
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FIG. 3: Comparison of E^^"" distribution for the inverted and baseline r+jets selection early in the cut flow. Inverted 
selection inverts r identification cut and 6-tag requirement. 



The orthogonal selection is defined by inverting the r selection and the 6-tag requirement. The r lepton 
must pass a loose identification cut while failing a tighter cut, and there must be no 6-tagged jets in the event. 
The EJp'""' distributions for this case are shown for the inverted and baseline selections, after subtracting the 
non-QCD backgrounds estimated by simulated events, in Figure 3. This comparison is made early in the cut 
fiow, in order to have sufficient purity and statistics to make the comparison. The two distributions match fairly 
closely, and the difference between the distributions is taken as a systematic uncertainty. 

The inverted selection is taken as a model for the shape of the QCD background. The two shapes used for the 
fit are the iJ™**'* distribution for this model and the EJp^'^" distribution of the non-QCD events from simulation. 
The two free parameters of the fit are the overall normalization, which is fitted to the data, and the fraction of 
QCD multi-jet events. The result of the fit is shown in the plot to the left in Figure 4. The plot to the right 
in Figure 4 shows the fit result applied to estimate the QCD contribution to the Mt{t, E™"'') distribution. 
Mt{t, E™^^^), the transverse mass of the r lepton and i?™'^*, is the final discriminating variable for the r+jets 
analysis, and it is used in the limit setting process. 



D. Embedding Method 



Backgrounds with true r leptons are estimated in a data-driven way using a method called embedding. Events 
with correctly identified r leptons are a major background in the r+jets analysis. First, /i+jets are selected 
with a high purity from data. The muon signature (tracks, calorimeter deposition) is removed from the event, 
and a simulated r with the same properties, but rescaled, is embedded in the event. Then, the full baseline 
selection is run over this embedded sample to estimate the total contribution to the background that results 
from events with true r leptons in the final state. This method has been validated in r+jets events using early 
ATLAS data[8]. 
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FIG. 4: Left: Results for the QCD fitting method for r+jets, Z?™"" distribution. Right: Results applied to the 
Mt{t, Er^^'") distribution. 



IV. RESULTS 
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FIG. 5: Final plot of Mt{t, E^^"") for the r+jets analysis, using the data driven methods to estimate background 
contributions. For reference, the charged Higgs boson signal and signal+background contribution for a charged Higgs 
boson mass of 130 GeV and a BR{t H+b) = 0.1. 



A final Mt{t, E™'^''^) distribution is shown in Figure 5, with all background contributions taken from data- 
driven methods. Also overlaid for reference are the signal and signal-|-background distributions for the case of 
a charged Higgs boson with a mass of 130 GeV and a BR{t — Hb) =0.1. In the signal -l-background case, the 
background is the Standard Model background with altered branching ratios due to the hypothetical existence 
of a charged Higgs boson with BR{t Hb) = 0.1. 

Exclusion limits are set on the product of the branching ratios BR{t — ^ and BR{H^ — >■ tv) with 

respect to the charged Higgs boson mass, and also on the tan(/3) — plane in the mh-max scenario of the 

MSSM, by applying the CL^ procedure[9, 10]. A profile likelihood ratio [11] is used with the Mt(t, £'5?''*'*) as 
the discriminating variable. Shown in Figure 6, values of the product of branching ratios, BR{t — >■ H~^b) and 
BR{H^ — 7> Tiy), larger than 0.03 — 0.10 have been excluded in the mass range 90 — 160 GeV, significantly 
extending limits from other experiments. Results interpreted in the context of the mh-max scenario of the 
MSSM are shown in Figure 7. Values of tan(/3) above 22 — 30 (depending on mH+ ) can be excluded in the mass 
range 90 GeV < mH+ < 140 GeV. 
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FIG. 6: Expected and observed 95% CL exclusion limits for charged Higgs boson production from top quark decays as a 
function of mjj-[. in terms of BR{t — > H^b) x BR{H^ — > tu). For comparison, the best limit provided by the Tevatron 
experiments is shown [12]. 




FIG. 7: Limit for charged Higgs boson production in top quark decays in the Mj^+ — tan(/3) plane. Results are shown 
for the MSSM scenario mh-max 
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